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System for modifying the time-base of a video signal 



FIELD OF THE INVENTION 

The invention relates to a system for modifying a time-base of a digital video 
stream; the system including an encoding device and a storage device connected via a digital 
video communication system, and a decoder/renderer. The invention further relates to a 
storage device and a time-base modifier. The invention also relates to a method of modifying 
a time-base of a digital video stream and to software for executing the method. 

BACKGROUND OF THE INVENTION 

Nowadays, most video signals have been recorded on tapes in an analog form. 
The quick uptake of digital video for playback, for example using DVD, has as a 
consequence that demand for recording video material on a digital recording medium also 
increases, for example using a recordable optical disc, like DVD+RW, or hard disc based 
recording devices. A commonly used digital encoding format is MPEG, in particular 
MPEG2. For recording an analog video signal, the user uses an analog r^roducing device, 
such as a VCR and Camcorder to generate the analog video signal and uses an (MPEG) 
encoder to encode this signal to a digital format for storing on a digital storage medium. The 
MPEG encoder may be a stand-alone device, or integrated ui another device, like the VCR, 
television, etc. Via a digital video communication system, such as the PI 394 bus, flie 
encoded signal can be suppUed to a decoder/renderer or to a digital storage device for 
subsequent supply to a decoder/renderer. To enable decoding/rendering the encoded signal 
must comply with the encoding/transmission standard, such as MPEG, MPEG video 
encoding/transmission is, among others, described in hitemational Standard ISO/IEC 13818- 
2: 1995(E) Information Technology; Generic coding of moving pictures and associated 
information: Video and International Standard ISO/IEC 13818-1: 1996(E) Information 
Technology; Generic coding of moving pictures and associated information: Systems. 
Especially tuning of the stream of digital video is important To this end, digital video 
encodmg/transmission standards specify clock signals that enable clocks in the encoder and 
decoder to run m a synchronized manner. These clock signals will be referred to as program 
clock reference stamps (PCR), Additionally, thnmg signals are supplied for the individual 
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frames to enable a rendering device to render the frames at the correct time. These timing 
signals will be referred to as presentation time stamps (PTS). Moreover timing signals can be 
present that indicate to the decoder when the MPEG decoding can start. These signals will be 
referred to as decoding time stamps (DTS). The analog or digital video signal suppUed to an 
encoder in addition to the video signal also includes a frame signal (vsync) that indicates 
frame boundaries in the video signal. The frame signal is thus synchronous with the video 
frames. The PTS signals in the encoded stream are usually inserted in the encoded video 
stream synchronous with the arrival of the vsync signal. 

For MPEG2 transmission as specified for DVB (Digital Video Broadcast), the 
timing for the PGR and PTS are based on a same clock with a defined fi^quency and 
tolerance. For example, the clock defined by MPEG runs at 27 MHz. and should deviate less 
than 30 ppm (parts per miUion). Usually, the firame rate of a video signal is 25 Hz. or 30 Hz. 
and the vsync pulse has the same frequency. A straightforward conversion of an analog 
video signal having a frame rate of exactly 25 Hz. will result in an MPEG stream with PTS 
values present once every 40 ms. However, the deviation of the frame rate of a video signal 
reproduced with analog reproducing devices may be up to 100 ppm. To be able to obtain a 
digital video signal which has a PTS once every frame period, the MPEG encoder could be 
locked to the vsync of the video input signal. The encoded digital video signal thus obtained 
will have a PGR which deviates 100 ppm and is therefore not MPEG compliant. Such an 
encoded video stream should not be supplied throug^i a network to a decoder/renderer as the 
decoding device may not be able to decode the signal. On the other hand, directly using the 
MPEG clock and ignoring too large deviations in frame rate may result in rendering 
problems, such as having to skip or repeat frames. 

SUMMARY OF THE DSrVENTION ' 

It is an object of the invention to provide a system that enables a user to render 
an encoded video signal in real-time, possibly with a loss in quality, whereas the signal can 
be perfectly reproduced after having been recorded by a storage device. 

To meet an object of the invention, a system for modifying a time-base of a 
digital video stream includes an encoding device and a storage device connected via a digital 
video communication system, and a decoder/renderer; 

the encoding device including: an input for receiving an input video signal, 
and an input firame signal (vsync) synchronous to and indicating frame boundaries in the 
input video signal; an encoder for converting the input video signal and input fi^me signal to 
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a corresponding digital video stream complying with a predetermined video encoding 
standard; the encoder including a encoder clock complying with the video encoding standard; 
the digital video stream including: a sequence of digital video frames corresponding to 
received input video frames; a sequence of program clock reference stamps (PGR) 
representing a clock signal of the encoder clock; and a sequence of video presentation time 
stamps (PTS) each associated with a respective digital video frame and representing a value 
of a counter driven by the clock signal at a moment of receipt of an input video frame that 
corresponds to the digital video frame; and an output for providing the digital video stream 
via the digital video communication system; 

the storage device including: an inpnt for receiving the digital video stream via 
the digital video communication system; a time-base modifier operative to replace the video 
presentation time stamps (PTS) by respective modified video presentation time stamps (M- 
PTS) based on a constant predetermined frame time between successive presentation time 
stamps; and to replace the program clock reference stamps (PGR) by respective modified 
program clock reference stamps (M-PCR) by scaling the program clock reference stanq)s 
(PGR) using a scaling factor that depends on a ratio of an expected time between a video 
presentation time stampy and a preceding video presentation time stamp j-n and an actual 
time between the video presentation time stampy and the preceding video presentation time 
stampy-n, wherey ^>0, and the expected time is n times the predetermined frame time; 

a storage for storing at least a part of the time-base modified video stream; and 
an output for providing a video stream &om the storage device to the 
decoder/rraderer; and 

the decoder/renderer including an input for receiving a video stream from the 
storage device and being operative to decode the video stream received from the storage 
device to enable rendering of the digital video frames in the stream synchronous wifli the 
respective associated modified video presentation time stamps. 

In the system according to the mvention, the encoded video stream folly 
complies with the video encoding standard with respect to timing requirements for PGR and 
PTS. As such, a standard decoder/renderer that receives the stream through a communication 
interface can decode and render the stream in real-time, albeit at a possible loss of quality 
since the decoder may occasionally need to skip a fi^e (if the actual fi^e rate of the input 
video signal is higher then the maximum allowed deviation) or repeat a fi^e (if the actual 
frame rate of the input video signal is lower then the maximum allowed deviation). All video 
frames are present in the stream. A storage device is used to record fimnes of the stream. The 
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frames can be supplied from the storage at a coirected frame rate that falls within the 
tolerances defined by the video encoding standard. To this end, the time base is scaled by 
comparing the actual frame times with the expected frame times. The actual frame times are 
derived from the PTS. The expected frame times are fixed, e.g. 40 msecs. between successive 
fr^es for a 25 Hz. system and thus the expected firame time between frames j-n and J that 
are sequentially n frames apart («>0) is nMO msecs, for such a firame rate. The actual frame 
time between those frames is determined by PTS[/]-PTSI>-7i] (after conversion of a clock 
number to a corresponding duration). Comparing the actual &amG times to the expected 
firame times gives a scaling factor. The PGR values can be scaled using this factor. Similarly, 
also the DTS values should be scaled if they are present. After this scaling, the offset between 
successive PTS values should correspond to the fixed firame time. Scaling may be done on 
two immediately successive PTS values (in this case »=1), giving a highly dynamic 
adjustment. Scaling may also be done on a larger time, e.g. 5 frames apart («=5) or even all 
frames since the start (n=J, if the first frame is numbered 0). The storage acts as a buffer to 
compensate for the differences in actual frame rate and the constant desired frame rate of tiie 
encoding standard. Such a buffer needs to be substantially larger than required for jitter 
compensation as defined by the video encoding standard. The time-base correction according 
to the invention can be performed using a digital filter that calculates the modified PGR 
values. 

According to the measure of the dependent claim 2, in an embodiment the 
time-base modifier includes a clock unit operative to generate a clock signal locked to the 
received video presentation time stamps (PTS) using an error signal that depends on the 
scaling fector; the time-base modifier being operative to obtain the modified program clock 
reference stamps (M-PCR) by sampling a counter driven by the clock signal at a moment of 
receipt of the program clock reference (PGR). By locking the clock to the PTS signal, the 
clock signal is automatically scaled to the actual PTS signal compared to a PTS signal that is 
based on a constant, standard-defined frame rate. The scaled clock can be used to replace the 
original PGR signal. 

According to the measure of the dependent claim 3, the received video 
presentation time stamps are low-pass filtered and the clock unit is locked to the filtered 
video presentation time stamps. In this way, higih frequency jittering is removed, providing a 
more stable time-base correction. Alternatively, according to the measure of the dependent 
claim 4, the time-base modifier is operative to low-pass filter the scaling factor. 
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According to the measure of the dependent claim 5, the digital video stream 
includes information on a nominal frame rate of the video signal and the time-base modifier 
is operative to derive the predetermined frame time from the digital video stream. The MPEG 
transport stream already includes this information (indicated by the so-called frame rate code 
5 in the sequence header). In this way, the time-base modification can be used for differing 
frame rates without a user having to configure the frame rate. 

According to the measure of the dependent claim 6, the input of the encoding 
device is operative to receive an analog audio signal; the encoding device fiirther including a 
sampler for sampling the received analog audio signal under control of a sampling clock 

10 signal that is derived from the video input signal and locked onto the input firame signal 

(vsync); and wherein the encoder is operative to convert the sampled audio signal into a time 
sequence of corresponding audio frames and insert the audio frames and respective audio 
presentation time stamps (A-PTS) in the digital video signal stream. In this way, the audio 
signal is sampled synchronous to the video signal, enabling synchronous reproduction by a 

1 5 decoder/renderer that decodes the stream in real-time. 

According to the measure of the dependent claim 7, wherein the time-base 
modifier is operative to replace the audio presentation time stamps (A-PTS) by modified 
audio presentation time stamps (MA-PTS) by scaling the audio presentation time stamps (A- 
PTS) using the scaling factor. The encoded audio signal is scaled in the same way as the 

20 video PGR (thus depending on the scaling determined for the video signal). In this way, after 
reproduction from the storage, the audio signal can still be reproduced synchronous to the 
video signal. 

According to the measure of the dependent claim 8, the storage device is 
operative to time stamp each packet of the digital video stream on receipt of the packet; to 

25 store each time stamp in the storage in association with the corresponding received packet; 
and to output packets of the stored stream according to the respective time stamps and a 
predetermined delay. By time stamping each packet, the storage device can issue the packets 
from storage at the desired instants. 

According to the measure of the dependent claim 9, the storage device 

30 includes a clock for providing timing signals and the storage device being operative to use as 
the time stamps stored in the storage the tutning signal scaled using the scaling factor. The 
storage time stamps are thus scaled in the same way as the video PGR time stamps. In this 
way, packets that do not have a PGR value can still be released from the storage at the 
corrected time. 
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According to the measure of the dependent claim 10, the storage device is 
operative to use as the time stamps stored in the storage a counter value from a counter 
driven by the clock signal locked to the received video presentation time stamps (PTS). In 
this way, the akeady scaled PGR signal is also used to time stamp packets that do not have a 
PGR value, giving an accurate time for releasing the packets from storage. 

According to the measure of the dependent claim 1 1, the video encoding 
standard is MPEG2. This encoding/transmission standard is widely supported. 

According to the measure of the dependent claim 12, the digital video 
communication system includes an isochronous coromunication channel for transferring the 
digital video stream. This gives a reliable transmission of the video stream. Preferably, 
IEEE1394 or USB are used. 

These and other aspects of the invention are apparent from and will be 
elucidated with refermce to the embodiments described hereinafter. 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the drawings: 

Fig.l shows a block diagram of the system according to the invention; 

Fig.2 shows an embodiment of an encoder according to the invention; 

Fig.3 shows a fiirther embodiment of an encoder; 

Fig.4 shows a block diagram of a conceptual recoder; 

Fig.5 shows an embodiment of the storage device; 

Fig.6 shows an alternative embodiment of the storage device; 

Fig.7 and 8 illustrate the adjustment to the time-base; and 

Fig.9 illustrates the locking of a PLL to vsync. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Fig. 1 shows a block diagram of the system according to the invention. The 
system includes an encoding device 10, a storage device 20 and a decoder/renderer 30. The 
encoding device 10 includes an input 12 for receiving at least one video signal. In the 
example of Fig.l, it receives a video signal v.s. and an audio signal a.s. The encoding device 
10 also includes an encoder 14 for encodmg die AV signal(s) to an encoded stream for supply 
to at least the storage device 20 via a digital communication system 40. The encoding and 
transfer is accordmg to a predetermined encoding standard. In a preferred embodiment, it 
complies with MPEG2. However, persons skilled in the art will be able to ^ply the 
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principles also to other encoding/transmission formats. Any suitable communication system 
may be used. To enable stream-wise pushing of the data, the communication system 
preferably provides at least one isochronous commimication channel. Suitable 
communication systems include, but are not limited to, USB and, in particular, IEEE 1394 
(also known as Firewire or iLink). By using standardized encoding/transmission, the stream 
transferred from the encoder to the storage device can also be supplied to a decoder/renderer 
50 for real-time rendering. The decoder/renderer 50 may be connected to the same channel 
40 or to an additional channel (not shown). In Fig.l, the decoder/renderers 30 and 50 are each 
shown as one block. It will be appreciated that such functionality may equally well be 
divided into separate devices. 

The encoding device 10 may be a stand-alone device but may also be 
integrated with other functions. For example, the? encodmg device 10 may be part of a 
television, set-top box (STB), or personal computer (PC). In particular, the encodmg device 
10 may be integrated in a source device of the AV signal, such as a VCR or Camcorder. 

The storage device 20 includes a time-base modifier 22 and a storage 24. The 
storage 24 is used for storing at least part of the time-base modified video stream. The 
storage 24 acts at least as a buffer to compensate for the difference in frame rate of the video 
signal coming into the encoding device and being output from the storage device. For 
example, with a 0.5 % difference in rate, the storage should be able to minimally store at least 
0.5% of a video title in the situation where supply of the video title from the storage is started 
immediately. Preferably, the storage 24 is able to store at least one average size video title in 
fiiU to enable later decoding/rendering at a time desired by a user. The storage 24 may be a 
removable type of storage medimn, such as DVD+R, DVD+RW, DVD-RW, DVD-RAM, 
DVHS, Blu-ray disc, solid state memory, etc. Preferably, the storage 24 is a high capacity 
storage, such as a hard disc, capable of storing several average size video titles in full. 

Encoding device 

Fig.2 shows an exemplary block diagram of the encoding device. In this 
example, the encodmg device includes an A/D converter 210 for sampUng a video signal v.j, 
received through an input 205 and converting it to a digital representation. The incoming 
video signal has an associated vclk clock signal (typically 27 MHz.). The sampling is line 
locked to this clock signal. The incoming video signal also includes a vsync pulse (typically 
25 Hz. Or 30 Hz.) that marks the start of a new video frame. The digitized video data and the 
vsync signal are supplied by the A/D converter 210. The encoding device further includes an 
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encoder 220 and an encoder clock 230. The encoder clock 230 is free running according to 
the requirements of the video encoding/transmission standard. For MPEG, the encoder 230 is 
clocked from a crystal, which delivers a very precise 27 MHz signal (must be 27 MHz +/-30 
ppm according to the MPEG standard). The clock signal is used to increment a counter 245. 
The PGR time-base counter (Program Clock Reference) is directly derived &om this main 
encoder clock. PGR is thus a counter value. Using the predetermined frequency, a difference 
in counter values can also be expressed in a time difference. To simplify examples, in the 
remainder sometimes differences between PGR's will be expressed in time instead of the 
actual difference in coimt^ values. The same holds for other counters/time-stamps, such as 
PTS. In tiie embodiment of Fig.2, each time a new video frame arrives at the video input 205, 
a time stamp V-ITS (Video Input Time Stamp) is created by sampling the clock counter 
value. Some or all of these time stamps will be reflected in the digital video stream via the 
communication system 40 in the form of the PTS-Presentation Time Stamp. PTS specifies to 
an encoder/renderer when the associated video frame should be supplied for rendering. The 
PTS value is thus the 'same' as V-ITS with a fixed delay representing the delay in the entire 
processing chain. To be able to communicate via the communication system, the encoding 
device 10 includes a commxmication interface 280. Hardware and/or software for such 
interfaces are well-know and will not be described here fiirther. 

In the system according to the invention it is allowed that there is a 
disturbance in the source, in the sense that the line locked (27 MHz) clock has a lot of jitter 
on a line by line basis (but is locked on a frame basis), and that the frequency of the vsync 
signal deviates considerably from the expected value (e.g. 24.5 Hz. instead of 25 Hz.). Such a 
situation may, for example, occur when the input signal of the encoding device is received 
from a VGR at playback (or even worse: trickmode). 

In an embodiment as is illustrated in Fig.3, in addition to a video input signal 
V.S., the encoding device also has an input 240 for receiving an audio signal a. v.. In this 
example, the audio signal is still analog. The encoding device thus includes an A/D converter 
250 for sampUng the audio signal a,s. and converting it to a digital representation (stream of 
audio frames/packets). In principle a free running sampling clock, running at for example 48 
KHz. could be used. However, to keep the audio synchronous to the video, also after during 
fluctuations m the timing of the video signal and after time-base correction of the digital 
video stream, in a preferred embodiment, the audio sampling clock signal is derived from the 
video input signal and locked onto the input frame signal (vsync). In the example of Fig.3, 
this is achieved by using a PLL (Phase Locked Loop) 260 locked to vsync. The clock 
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supplied by the PLL is divided to the frequency required for the audio sampling (e,g. dividing 
the clock frequency by a factor of 27 MHz./48 KHz =562.5). The audio frames/packets are 
also time stamped on receipt using associated A-ITS (Audio Input Time Stamps). Similar to 
what was described for the video, some or all of these A-ITS values may be represented in 
5 the video stream in the form of respective audio presentation time stamps (A-PTS) to steer 
the supply of the audio from the decoder/renderer. 

It will be appreciated that there may be more than one video input signal and 
that there may also be several audio signals (e.g. stereo, multi-channel, different languages, 
etc.). Sampling/time-stamping of each of these signals is preferably synchronous to vsync of 

10 the main video signal. The various resulting streams of digital packets may be multiplexed 
into one transport stream by the encoder 220. Multiplexing/demultiplexing is well-known and 
will not be described here any fiuUier. The encoder 220 (or more precisely, the multiplexer of 
the encoder) inserts samples of the PGR time-base into the stream. The PGR values may be 
inserted at fixed intervals, e.g., every 40 msec. In reality, most encoding/transmission 

15 standards only define a maximum offset that is allowed between immediately successive 
PCR's. For example, DVB specifies that at least once per 40 msec, a PGR value must be 
inserted. Typically, PGR values are not inserted at regular intervals. In a decoder/renderer, 
such as shown in Fig.l using numbers 30 and 50, the received values of the PGR are used to 
reconstruct the encoder PGR counter. In this way, the decoder clock runs synchronous to the 

20 encoder clock. Gonsequently, frames can be issued at the right time, using its PTS value. 

Transmission standards define a maximum deviation in the clock signal used for creating the 
PGR's to guarantee that all decoders can reconstruct the clock signal. According to MPEG, 
the decoder may not be able to reconstruct the PGR counter if the PGR clock is outside the 
MPEG specifications of 27 MHz +/- 30 ppm. Since the encoding device fiilly complies with 

25 the involved standard, it can be assumed that the reconstruction is perfect, i.e. the PGR 
counter in the decoder is a copy of the PGR counter in the encoder. 

Because of the fact, that the video vsync is not locked to the main encoder 
clock, the difference between the PTS's of two successive video access units is generally not 
precisely equal to the expected 40 ms (in the example of 24.5 Hz it is larger). Similarly, the 

30 difference between 2 audio access units will generally not be precisely equal to the expected 
value, because the audio clock is also not locked to the PGR clock. In the decoder, the PTS's 
that are coded in the stream are used to determine the pres^tation time of access units in 
relation to the value of the reconstructed PGR counter. In practice artifacts may occur in 
decoder/renderers. For example, a conventional decoder/renderer may use a single clock. 
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locked to the incoming PGR, for audio and video decoding. It may start decoding of 
audio/video access units at an expected pace, independent from the precise PTS values that 
are in the stream. For example, for PAL the decoder may start decoding a new frame after 
every 40 ms of the PGR clock. Before decoding of a particular frame, the decoder will then 
5 check whether or not its PTS is still within a distance of 1/2 frame period from the really used 
decoding time Tdec (Tdec follows from the assumed fixed-pace decoding). If not, the decoder 
makes the following correction: 

IF (Tdec - PTS > 1/2 frame_period) THEN skip_access_unit; 

IF (Tdec - PTS < -1/2 framejperiod) THEN repeat_access_unit; 
10 In this way, the decoder still effectively locks the presentation times to the PTS's, but 
artifacts will occur at regular times if the audio and video were not already locked to the 
PGR. Nevertheless, the incoming signal is precisely reconstracted: all audio/video packets 
are there. However, the decoder will have to skip and/or repeat access units for rendering. 
The sj^tem according to the invention overcomes this problem. 

15 

Storage device 

Fig. 4 illustrates the solution in a conceptual way, using a storage device 400 
(same as storage device 20 of Fig. 1) acting as a recoder. To explain this, the storage device is 
assumed to contain a (e.g., MPEG) decoder 410 and an additional encoder 420. The encoded 
20 input stream received via the comimunication system through a conamunication interface 405 
is ftiUy compliant with the encoding/transmission standard as described above. Note, that the 
audio and video PTS's are again derived by adding a constant delay to the ITS's. The ITS's 
are samples of the PGR time-base, so they are also derived from the precise clock at, for 
example, 27 MHz. 

25 In the storage device, the received signal is decoded by a (conceptual) decoder 

410. Because of the compliant input stream, the decoder 410 is able to exactly reconstruct the 
signal as it came into the encoder of the original encoding device, e.g. number 10 of Fig. 1 . 
This means that also the vsyncs that are produced by the decoder based on the PTS's are a 
copy of the vsync as it came into the original encoder. Inside the storage device, there now is 

30 a second (conceptual) encoder 420, which re-encodes the decoded signal, thereby locking its 
time-base to the incoming vsyncs. In the embodiment of Fig.4 the locking is achieved using a 
PLL and VCO (Voltage Gontrolled Oscillator). The VGO drives the modified PGR counter 
used by the encoder 420. The stream that comes out of this encoder 420 contains all AA^ 
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data, but its time-base has changed. By storing it in storage 430 and playing it back from the 
storage at the modified time-base, a fiilly compliant signal with new time-base is achieved. 

In practice, it is not required to use a full-blown decoder 410 and encoder 420 
in the storage device 400. In fact, the actual AA^ packets are not changed at all and, thus, do 
5 not need to be decoded and re-encoded. Only, the main time hasp, defined by the PCR's is 
changed and all stream fields that depend on it. In particular, Presentation Time Stamps 
(PTS) and, if present. Decoding Time Stamps (DTS). Thus, the same can be achieved using 
some additional processing on incoming PTS and PGR timestamps. This processing may be 
done using only software executed on a processor, such as a microcontroller or digital signal 
1 0 processor. Such processing may be done in real-time or ofiF-line. As an altemative to fiiU 

software-based processing, partial hardware support may be used as will be described below 
in more detail. 

Fig.5 illustrates that according to the invention, the time-base modifier 510 in 
the storage device 400 scales the time-base (i.e. PCR's) such that the modified PTS's have 

15 values reflecting a constant predetermined frame-time between successive frames. In this 
figure, same reference numbers as used in Fig.. 4 refer to the same features. For a 25 Hz. 
system, the predetermined frame-time is 40 msecs. It should be noted that not all firames need 
to have a PTS. So, successive PTS's may have values reflecting a multiple of the frame-time. 
Li the examples below, it is assumed that each frame is associated with a respective PTS to 

20 simplify the description. Thus, the time-base modifier replaces the video presentation time 
stamps (PTS) by respective modified video presentation time stamps (M-PTS) based on the 
constant predetermined firame-time between successive presentation time stamps. For 
example, numbering the successive PTS's using an index j, 
M-PTS[/+1]= M-PTS[/-] + frame-time, or 

25 M-PTS|>-] = PTS[0] +y X frame-time 

The time-base modifier replaces the program clock reference stamps (PGR) by 
respective modified program clock reference stamps (M-PCR) by scaling the program clock 
reference stamps (PGR) using a scaling factor that depends on a ratio of an expected time 
delay between a video presentation time stamp j and a preceding video presentation time 

30 stampy-n and an actual time between the video presentation time stamp J and the preceding 
video presentation time stamp 7-/2, where J>n>0, and the expected time is n times the 
predetermined firame-time. The period over which the scaling extends, reflected by n, may 
be chosen. In one extreme, n^J. In such a case, the scaling factor at the time of PTSQ"] is 
given by: 
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(nxframe-time)/ (PTS[j]-PTS[0]), 
where PTSjj] represents the time of the >th PTS based on its counter value. As more PTS's 
are received the intervals increase and the scaling factor gets more stable. In another extreme, 
the scaling factor is highly dynamic based on the actual time in between two immediately 
5 successive PTS's (n=l). In such a case, the scaling factor at the time of PTSD] is given by: 
frame-time/ (PTS|j]-PTSD-l])- 
Also other choices for n may be made (e.g. n=5 PTS periods). In the remainder an example 
will be given for w=/ Persons skilled in the art will be able to apply the same technique for 
other values of «. 

10 For each received PTS (indicated as recPTS with index between bracket [], an 

expected PTS (indicated as expPTS) is determined with which the received value is 
compared. For initiaUzation, the first expected PTS is set equal to the first received PTS: 

expPTS[0]=recPTS[0] 
All following expected_PTS's are calculated by adding the constant firame-time to the 
15 previous expected_PTS: 

expPTS[/+l] = expPTSDl + frame-time, 7>0. 
For example for a PAL video system, a new expPTS is always set 40ms higher than its 
predecessor. 

For each received PGR value PCR[^] in the transport stream, a target T[k] is 

20 derived as: 

r[A:]= {PCR[kyPCR[0])x scalingfactor + PCR[o] (1) 

where 

.r.nnnfM ^xp cxp Pr^[o] _ jy^ frame ^ time 
scalingfactor^ recPra[/]- reci>ra[o] " recPTS\jyrecPTS[o] 

where k denotes the index of the received PGR and j corresponds to the number of PTS 
25 values received before the PGR value. The value ofj must be bigger than 0 before the 

algorithm can start working. The target T{K\ can now be used as the modified PGR, indicated 
asMPCR: 

MPCRC;t]=T[Ar] 

Similarly, the modified PTS, indicated by MPTS, is fiilly corrected and is thus given by: 
30 MPTSW = expPTSW. 

Typically, there will be more PGR samples than PTS samples in a stream. This leaves the 
question what J (index for PTS) should be used for modifying a PGR indicated by index k. 
Particularly, if the modification is real-time, it will be clear that the scaling factor has to be 
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known before the scaling of PGR can take place. To optimally reflect a recent change, 
preferably a given PGR is modified based on a scaling factor derived firom at least the 
immediately preceding PTS. Since there may be more PGR samples then PTS samples, there 
may be several PGR samples that are modified with the same scaling factor. For off-line 
correction, the same principle can be applied. However, in such a case a scaling factor 
determined based on PTS[/] and PTS[/-n] can also be used for correcting PGR samples that 
lie within this interval. 

Above algorithm shows that the modified clock base can be calculated off-line 
(i.e. non-real-time) or real-time without really reconstmcting a time signal. 

In an altemative embodiment, as shown in Fig.6, the time-base modifier 610 
reconstructs a scaled time signal using a PLL. Same reference numerals as used in Figs.4 and 
5 refer to same functions. To steer the PLL, an error signal E is derived as: 

B[k] = time-base[A:] - T[^], 
where time-base[/c] is the sample of the time-base counter that was taken when receiving 
PGR packet As a start-up the following can be used: 

tmie-base[0]=PGR[0] 
All the calculations, including the division ui the scaling factor do not have to be very 
precise, since tiie result is only used to steer the PLL. Note, that the Equation (1) in essence 
aims at locking the PLL to the average firame-time (running average firom frame 0 to firamey). 

In a preferred embodiment, a low-pass filtered version of the frame-time is 
used to lock the PLL. This may be done in any suitable way. For example, the time-base 
modifier may low-pass filter the received video presentation time stamps and the clock unit is 
locked to the filtered video presentation time stamps. Alternatively, the time-base modifier 
low-pass filters the scaling factor of equation (2). 

The error signal E is used by the PLL to modify the frequency of the main 
encoder clock (e.g. 27 MHz clock). If E>0 then the frequency is decreased, if E<0 then the 
frequency is increased- The goal of the PLL is to control the error signal to zero. 

That above mentioned approach indeed leads to a lock onto the vsync can be 
understood by using an example that shows the start-up behavior of the PLL. In Fig.7 the 
timing of a received transport stream is depicted which contains video frames that originally 
came into the encoder at 21.7 Hz (=1/0.046). It should be noted that this is an exceptionally 
low frequency, used to illustrate the concept. The x-axis shows the real time, which for the 
example is assumed to correspond to the time according to the precise 27 MHz time-base in 
the encoder. In tide example, PGR values are present at regular intervals once every 40 ms. In 
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reality, this may not be the case. The first PGR value corresponds to a time of 210 ms, and 
since the encoder is assumed to use a perfect time-base, the values in the PGR packets are the 
same as the time at which they are received. Several (e.g. 5 or 12) PTS values may be 
present in the stream per MPEG GOP (group of pictures). However, at what time these PTS 
values are received depends on the sizes of the particular frames in the GOP. E.g. since the 
first frame (=I-frame) is noraially the largest, the difference in receival time between the first 
and second PTS in the stream is normally relatively large. The values of the PTS timestamps 
depend on the time at which the corresponding firame was received at the input of the 
encoder. In the example, each successive PTS has a value that corresponds to 46 msecs. later 
than its predecessor. The value of the first PTS is chosen in the encoder according to the 
desired vbv_delay. 

In FIG. 8, the timestamps that are modified according to the above mentioned 
algorithm are shown in boxes. Furthermore the value of the PLL em^r signal E[j] is shown. 
As can be see in the figure, all PTS's are overwritten by their expected value (with a fixed 

15 frame-time of increase per frame, e.g., 40 ms). The PLL error signal can only be calculated 
after 2 PTS values are received (/=1). Until then the error signal is 0 and the VCO runs at an 
uncorrected 27 MHz. The first error signal unequal to 0 is calculated at t=290 ms. Before that 
time, the PGR is overwritten by a free running time-base (for now we assume again that this 
time-base runs exactly at 27 MHz), so that the PGR is not changed. In the example, the 

20 calculation is again in actual time and not in corresponding clock values. The target T[2] for 
PGR[2] at t=290 equals: 

^ , jy^frametime 1x40 

scalingfactor = r i n = =0.87 

recPTS\j\-recPTS^] 406-360 

T\l]={PCR\z]'-PCR[0\^>^scalm^ 
Thus: 

25 MPTS[/] = 360+j x 40, for j>0 

Using a locked PLL, the error signal equals 

E = 290 -279.6= 10.4 ms 
Because of this positive error signal, the PLL will slow down the clock. For the next PGR 
packet, this will lead to a reduced error. For example, the PLL may be controlled in a 

30 proportional manner, where its frequency/is given by: 

/=/ox[l~;^j,/o = 27MHz. 

Then: 
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AiPCR[3]=MPCR[2]+ {PCR[3]- Pa?[2])x j^l 
=290+40-5=319.6 




and 



MPCR[4]=AdPCR[3]+ {PCR[4]- PCR[3]}x ^1 
=319.6+40-5=319.6+40-5.3=354.3 




The corresponding target is: 



r[4]= {PCR[4]- PCR[0])x scalingfactor + PCR[o] 
= (370 - 210)x (480 - 360)/ (498 - 360)+ 210 = 349. 1 



(1) 



The error is then: 

E[^] = time-baseM - T[<| = 354.3-349,1=5.2. 
Using this proportional arrangement, the final error is reached if/=40/46 >^fo.ln this case: 



will have reached a stable situation with an error very close to zero. This is illustrated in 
Fig.9. The solid line represents the value of the time-base counter as a function of the real 
time. Before t=290 ms, the time-base is free running at 27 MHz. After that, the PLL slows 
down the clock, and the time-base approaches ttie ideal vsync locked time-base (dotted). This 
ideal time-base ran at 40/46 of 27 MHz from the begiiming. The crosses on the ideal curve 
show received PTS values normalized to the value of recPTS[0]. Once the PLL is locked, a 
situation is reached where, at playback, there is a time-base corrected output signal with 
vsync precisely equal to 25 Hz, 

Storage time stamps 

Since not all packets in the stream need to be associated with a PTS value and 
in certain applications it is desired to release packets from storage with exact time differences 
reflecting time differences of the packets before they were stored, frequently all packets are 
time stamped by the storage device. In such a way, the transport stream TS is transferred into 
a time-stamped transport stream TTS. The time stamping also uses flie clock that is based on 
the PCR's. In fact, in the storage device the reconstracted PGR values could be sampled and 




j_^^40 
40 46 



By using an integrator, as is well known in the art, after some time, the PLL 
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used for time stamping received packets. Since the format of the PGR is not ideal for storage 
purposes, usually an additional TS counter is used. The modification of the PGR time base 
should now also be applied to the TS time base. In the embodiment where a PLL is used, this 
is now simple. The PLL clock steers the TS counter and this counter is sampled and added to 
a packet received by the storage device. All the processing in the storage device can be 
steered by using the PGR-locked main clock. All TS packets that come mto the recorder are 
time-stamped from the TS time-base in the TS->TTS block. The resulting stream, called a 
time-stamped transport stream (TTS), contains 4 bytes of timestamp data and 188 bytes of 
video data per packet. At playback, the recorder uses a very precise 27 MHz clock, within 
the 30 ppm limits of MPEG. For each TS packet, the time- stamp that was added during 
recording is used to determine the time it has to be sent out in the TTS->TS block. This is 
done by comparing the time-stamp with a coimter that is clocked by the 27 MHz clock. The 
TTS->TS block contains a suflBcient amount of buffer memory to hold data that is not yet 
meant to go out. 

In an alternative embodiment, in a system without a PLL the same equations 
(1) and (2) can now be appUed for TS[k] instead of PGR[^]. It is required tiiat there is akeady 
a TS coupled to the PGR's in the stream. This can be done in the following way. As in the 
conventional decoder/storage device, a clock is used that is locked to incoming PGR values, 
thus reconstructing the encoder clock. The packets are time-stamped using a counter driven 
by this clock. In such an embodiment, the incoming steam must be real-time, using an 
isochronous communication channel. The time stamps are then modified as described above. 
Again, this time base modification can be done off-line and need not be real-time. 

In an embodiment, no incoming time stamps are used for the packets. Instead, 
the recording time stamp is estimated. This can be done at recording or playback from the 
storage. The storage already has modified PGR values. Packets that lie in between these 
PGR's are assigned a value based on these modified PGR's. A simple way of assigning a 
time is to use a proportional distribution over the time interval. So, if there are four packets in 
between two successive PGR's, the tune in between the PGR's is divided into 5 equal period. 
Starting from the first PGR, each time when the period has elapsed, the next packet is 
released from storage. 

It will be appreciated that tiie invention also extends to computer programs, 
particularly computer programs on or in a carrier, adapted for putting the invention into 
practice. The program may be in the form of source code, object code, a code intemiediate 
source and object code such as partially compiled form, or in any other form suitable for use 
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in the implementation of the method according to the invention. The carrier be any entity or 
device capable of carrying the program. For example, the carrier may include a storage 
mediimi, such as a ROM, for example a CD ROM or a semiconductor ROM, or a magnetic 
recording medixmi, for example a floppy disc or hard disk. Further the carrier may be a 
transmissible carrier such as an electrical or optical signal which may be conveyed via 
electrical or optical cable or by radio or other means. When the program is embodied in such 
a signal, the carrier may be constituted by such cable or other device or means. Alternatively, 
the carrier may be an integrated circuit in which the program is embedded, the integrated 
circuit being adapted for performing, or for use in the performance of, the relevant method. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many altemative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. Use 
of the verb "comprise" and its conjugations does not exclude the presence of elements or 
steps other than those stated in a claim. The article "a" or "an" preceding an element does not 
exclude the presence of a plurality of such elements. The invention may be implemented by 
means of hardware comprising several distinct elements, and by means of a suitably 
programmed computer. In the device/system claims enumerating several means, several of 
these means may be embodied by one and the same item of hardware. The mere fact that 
certain measures are recited in mutually different dependent claims does not indicate that a 
combination of these measures cannot be used to advantage. 



